Generation of Synthetic Transcriptome Data with Defined Statistical Properties for the Development and Testing of New Analysis Methods
نویسندگان
چکیده
We have previously developed a combined signal/variance distribution model that accounts for the particular statistical properties of datasets generated on the Applied Biosystems AB1700 transcriptome system. Here we show that this model can be efficiently used to generate synthetic datasets with statistical properties virtually identical to those of the actual data by aid of the JAVA application ace.map creator 1.0 that we have developed. The fundamentally different structure of AB1700 transcriptome profiles requires re-evaluation, adaptation, or even redevelopment of many of the standard microarray analysis methods in order to avoid misinterpretation of the data on the one hand, and to draw full benefit from their increased specificity and sensitivity on the other hand. Our composite data model and the ace.map creator 1.0 application thereby not only present proof of the correctness of our parameter estimation, but also provide a tool for the generation of synthetic test data that will be useful for further development and testing of analysis methods.
منابع مشابه
Clustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملOptimizing Cost Function in Imperialist Competitive Algorithm for Path Coverage Problem in Software Testing
Search-based optimization methods have been used for software engineering activities such as software testing. In the field of software testing, search-based test data generation refers to application of meta-heuristic optimization methods to generate test data that cover the code space of a program. Automatic test data generation that can cover all the paths of software is known as a major cha...
متن کاملAttitudes towards English as an International Language (EIL) in Iran: Development and Validation of a New Model and Questionnaire
This study aimed at developing and validating a new model and instrument to explore attitudes of Iranian EFL learners towards English as an International Language (EIL). In so doing, the researchers followed several rigorous steps including extensive literature review, content selection, item generation, designing the rating scales and personal information part, Delphi technique, item revision,...
متن کاملStatistical analysis of the parameters influencing the mechanical properties of layered MWCNTs/PVC nanocomposites
In this paper, a new method is proposed for the production of MWCNTs/PVC (multi-walled carbon nanotubes/ polyvinyl chloride) nanocomposites. In this method, a spray is used to produce layers of carbon nanotubes within a PVC matrix. Various parameters influence the production of the nanocomposite and its mechanical properties. These parameters are studied separately and the effect of each of par...
متن کاملFirst transcriptome analysis of Iranian scorpion, Mesobuthus eupeus venom gland
Scorpions are generally an important source of bioactive components, including toxins and other small peptides as attractive molecules for new drug development. Mesobuthus eupeus, from medically important and widely distributed Buthidae family, is the most abundant species in Iran. Researchers are interesting on the gland of this scorpion due to the complexity of its venom. Here, we have analyz...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 5 شماره
صفحات -
تاریخ انتشار 2007